8 research outputs found

    Article Segmentation in Digitised Newspapers

    Get PDF
    Digitisation projects preserve and make available vast quantities of historical text. Among these, newspapers are an invaluable resource for the study of human culture and history. Article segmentation identifies each region in a digitised newspaper page that contains an article. Digital humanities, information retrieval (IR), and natural language processing (NLP) applications over digitised archives improve access to text and allow automatic information extraction. The lack of article segmentation impedes these applications. We contribute a thorough review of the existing approaches to article segmentation. Our analysis reveals divergent interpretations of the task, and inconsistent and often ambiguously defined evaluation metrics, making comparisons between systems challenging. We solve these issues by contributing a detailed task definition that examines the nuances and intricacies of article segmentation that are not immediately apparent. We provide practical guidelines on handling borderline cases and devise a new evaluation framework that allows insightful comparison of existing and future approaches. Our review also reveals that the lack of large datasets hinders meaningful evaluation and limits machine learning approaches. We solve these problems by contributing a distant supervision method for generating large datasets for article segmentation. We manually annotate a portion of our dataset and show that our method produces article segmentations over characters nearly as well as costly human annotators. We reimplement the seminal textual approach to article segmentation (Aiello and Pegoretti, 2006) and show that it does not generalise well when evaluated on a large dataset. We contribute a framework for textual article segmentation that divides the task into two distinct phases: block representation and clustering. We propose several techniques for block representation and contribute a novel highly-compressed semantic representation called similarity embeddings. We evaluate and compare different clustering techniques, and innovatively apply label propagation (Zhu and Ghahramani, 2002) to spread headline labels to similar blocks. Our similarity embeddings and label propagation approach substantially outperforms Aiello and Pegoretti but still falls short of human performance. Exploring visual approaches to article segmentation, we reimplement and analyse the state-of-the-art Bansal et al. (2014) approach. We contribute an innovative 2D Markov model approach that captures reading order dependencies and reduces the structured labelling problem to a Markov chain that we decode with Viterbi (1967). Our approach substantially outperforms Bansal et al., achieves accuracy as good as human annotators, and establishes a new state of the art in article segmentation. Our task definition, evaluation framework, and distant supervision dataset will encourage progress in the task of article segmentation. Our state-of-the-art textual and visual approaches will allow sophisticated IR and NLP applications over digitised newspaper archives, supporting research in the digital humanities

    Metronomic Chemotherapy for Advanced Prostate Cancer: A Literature Review

    No full text
    Metastatic castration-resistant prostate cancer (mCRPC) is the ultimately lethal form of prostate cancer. Docetaxel chemotherapy was the first life-prolonging treatment for mCRPC; however, the standard maximally tolerated dose (MTD) docetaxel regimen is often not considered for patients with mCRPC who are older and/or frail due to its toxicity. Low-dose metronomic chemotherapy (LDMC) is the frequent administration of typically oral and off-patent chemotherapeutics at low doses, which is associated with a superior safety profile and higher tolerability than MTD chemotherapy. We conducted a systematic literature review using the PUBMED, EMBASE, and MEDLINE electronic databases to identify clinical studies that examined the impact of LDMC on patients with advanced prostate cancer. The search identified 30 reports that retrospectively or prospectively investigated LDMC, 29 of which focused on mCRPC. Cyclophosphamide was the most commonly used agent integrated into 27/30 (90%) of LDMC regimens. LDMC resulted in a clinical benefit rate of 56.8 ± 24.5% across all studies. Overall, there were only a few non-hematological grade 3 or 4 adverse events reported. As such, LDMC is a well-tolerated treatment option for patients with mCRPC, including those who are older and frail. Furthermore, LDMC is considered more affordable than conventional mCRPC therapies. However, prospective phase III trials are needed to further characterize the efficacy and safety of LDMC in mCRPC before its use in practice

    Management and outcomes following an acute coronary event in patients with chronic heart failure 1999-2007

    No full text
    AIM: The outcome of patients with chronic heart failure (CHF) following an ischaemic event is poorly understood. We evaluated the management and outcomes of CHF patients presenting with an acute coronary syndrome (ACS) and explored changes in outcomes over time. METHOD AND RESULTS: A total of 5556 patients enrolled in the Australia-New Zealand population of the Global Registry of Acute Coronary Events (GRACE) between 1999 and 2007 were included. Patients with CHF (n = 609) were compared with those without CHF (n = 4947). Patients with CHF were on average 10 years older, were more likely to be female, had more co-morbidities and cardiac risk factors, and were more likely to have a prior history of angina, myocardial infarction, and revascularization by coronary artery bypass graft (CABG) when compared with those without CHF. CHF was associated with a substantial increase in in-hospital renal failure [odds ratio (OR) 1.76, 95% confidence interval (CI) 1.15-2.71], readmission post-discharge (OR 1.47, 95% CI 1.17-1.90), and 6-month mortality (OR 2.25, 95% CI 1.55-3.27). Over the 9 year study period, in-hospital and 6 month mortality in those with CHF declined by absolute rates of 7.5% and 14%, respectively. This was temporally associated with an increase in prescription of thienopyridines, beta-blockers, statins, and angiotensin II receptor blockers, increased rates of coronary angiography, and 31.8% absolute increase in referral rates for cardiac rehabilitation. CONCLUSIONS: Acute coronary syndrome patients with pre-existing CHF are a very high risk group and carry a disproportionate mortality burden. Encouragingly, there was a marked temporal improvement in outcomes over a 9 year period with an increase in evidence-based treatments and secondary preventative measures.Ranasinghe Isuru, Naoum Chris, Aliprandi-Costa Bernadette, Sindone Andrew P., Steg P. Gabriel, Elliott John, McGarity Bruce, Lefkovits Jeffrey, Brieger David, and on behalf of the Australia-New Zealand GRAC

    On the role of aggregation effects in the performance of perylene-diimide based solar cells

    No full text
    A model bulk-heterojunction of a perylene diimide (PDI) monomeric derivative is studied for interrogating the role of PDI aggregates in the photocurrent generation efficiency (ηPC) of PDI-based organic photovoltaic (OPV) devices. Blend films of the PDI derivative and the poly(indenofluorene) (PIF) polymer annealed between room temperature and 220 °C, are used as the photoactive layers for the fabrication of OPVs. The positive effect of thermal annealing is assigned to the evolution of PDI aggregates in the amorphous PIF matrix. Annealing increases the electron mobility by three orders of magnitude. In contrast, owned to the thermally inert PIF matrix used, hole mobility increases only by a factor of six. High resolution cross-sectional scanning electron microscopy suggests that ηPC in PDI-based OPVs is not limited by the PDI aggregates but by their improper alignment. In situ Raman spectra and density functional theory calculations identify a marker for monitoring the strength of π-π stacking interactions between PDI monomers. It s further demonstrated that the electron-collecting electrode of the PIF:PDI devices dictates their performance. The use of Al is found to impede charge extraction and this is attributed to an unidentified product of the reaction between PDI and Al that leads to the formation of an electron-blocking layer. Device performance rectifies when a Ca/Al electrode is used and the power conversion efficiency is increased by a factor of four. © 2014 Elsevier B.V. All rights reserved

    CardioScape mapping the cardiovascular funding landscape in Europe.

    No full text
    Aims: The burden of cardiovascular disease is increasing worldwide, which has to be reflected by cardiovascular (CV) research in Europe. CardioScape, a FP7 funded project initiated by the European Society of Cardiology (ESC), identified where CV research is performed, how it is funded and by whom. It could be transformed into an on-line and up-to-date resource of great relevance for researchers, funding bodies and policymakers and could be a role model for mapping CV research funding in Europe and beyond. Methods and results: Relevant funding bodies in 28 European Union (EU) countries were identified by a multistep process involving experts in each country. Projects above a funding threshold of 100 k€ during the period 2010-2012 were included using a standard questionnaire. Results were classified by experts and an adaptive text analysis software to a CV-research taxonomy, integrating existing schemes from ESC journals and congresses. An on-line query portal was set up to allow different users to interrogate the database according to their specific viewpoints. Conclusion: CV-research funding varies strongly between different nations with the EU providing 37% of total available project funding and clear geographical gradients exist. Data allow in depth comparison of funding for different research areas and led to a number of recommendations by the consortium. CardioScape can support CV research by aiding researchers, funding agencies and policy makers in their strategic decisions thus improving research quality if CardioScape strategy and technology becomes the basis of a continuously updated and expanded European wide publicly accessible database.European Union FP7 research programm
    corecore